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In  Johnson,  Kotz  and  Sorkin  (1980),^the  authors  derived  the 
distribution  of  the  number  of  items  observed  to  be  defective  in  samples 
from  a  finite  population,  when  detection  is  erroneous  with  a  nonzero 
probability.  77*  f  /  ’  4i-. 

We  extend  here  the  above  results  by  taking  into  account  incorrect 
identification  of  nondefectives  as  well  as  defectives.  Corresponding 
waiting  time  distributions  are  also  derived.  Furthermore,  the  case  of  a 
stratified  finite  population  corresponding,  for  example ,  to  defective 
features  of  differing  severity  is  considered.  Numerical  values 
illustrating  the  dependence  of  the  corresponding  probabilities  on  the  two 
Siisidentification^  parameters  are  presented. 


Key  Words  and  Phrases:  binomial  distribution;  compound  distributions; 

faulty  identification;  hypergeometric  distribution; 
sampling  inspection;  waiting  time;  incomplete 
identification. 
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1.  Introduction 

Johnson  et  al.  (1980)  have  discussed  sane  problems  arising  when 
attributes  inspection  is  "less  than  perfect".  They  considered,  in 
effect,  sampling  without  replacement  from  a  lot  of  size  N  containing  X 
defective  (or  nonconforming)  items,  when  inspection  detects  such  items 
with  probability  p.  Here  this  is  extended  (i)  by  allowing  for  a 
probability,  p' ,  of  erroneously  deciding  that  an  item  is  defective  when 
really  it  is  not  and  (ii)  by  stratifying  the  population  so  that 
inspection  error  probabilities  vary  from  stratum  to  stratum. 
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The  unconditional  r 


th 


factorial  moment  of 


Z  is 


V(rl(Z)  -  l  (T)pVr_j  ElY^^n-Y)^5] 
lTJ  j-0  J 

( r)  t 

-  i  tr)PiP'r-jx«)(N-x)(r-« .  a) 

Nlr)  j-o  3 

In  particular, 

E [Z]  •  {Xp  *  (N-X)p’)n/N  «  np  (2.1) 

Var(Z)  -  np(l-p)  -  |  (1-  £)(p-p’)2  ,  (2.2) 

where  p  »  N’*(Xp  ♦  (N-X)p').  The  conditional  probability  mass  function 
(pmf)  of  Z  given  Y  is 

Pr[Z  -  z | Y]  -  f  (Wd-p/’j  rtp'z'j(l-p’)n‘Y*2+j 

j-0  3  (3) 

(2  -  0,1, ...,Y)  , 

vtoere  (£)  •  0  if  a  <  b. 

Hence  the  unconditional  pmf  of  Z  is 

priz  -  z]  -  ft'1 1  (J)cJJ'.y)  ]o  (J) ^ a -p)y- a -  P*f ^  . 

y  j*°  (4) 


when  the  first  J  is  over  max(0,  n*N+X)  s  y  *  min(n,X). 

Numerical  values  can  be  obtained  expeditiously  if  adequate  tables  of 


t  . 


A 


■  V  Oi 


hyper geometric  probabilities 


and  binomial  probabilities 


b(y;  n,p)  «  (y)py(l-p)n*y 
are  available  from  the  formula 

Pr[Z  -  z]  -  l  h(y;  n,X,N)  £  b(j;  y,p)b(z-j;  n-y,  p')  .  (4) 

y  j*o 

Table  1  gives  some  exarples  of  distributions ,  each  with  sample  size 
n  *  10.  It  is  limited  by  space  considerations,  but  fuller  tables  have 
been  calculated  (for  other  sample  sizes  as  well  as  other  values  of  X  and 
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Note  that  for  p'  ■  0,  the  same  distribution  is  obtained  if  the 
values  of  n  and  X  are  interchanged. 

As  N  and  X  are  increased  proportionately  to  each  other  with  X/N  «  u> 
say,  the  other  parameters  (n,p,p')  remaining  constant,  the  distribution 
of  Z  tends  to  a  binomial  with  parameters  n,  XN’*p  +  (1  -XN*1)?’. 

Another  single  special  case  is  p  ■  p',  leading  to  a  binomial 
distribution  with  parameters  n,p  (whatever  the  values  of  X  and  N). 
However,  this  is  a  most  unlikely  situation  -  -it  would  correspond  to 
completely  useless  inspection,  unable  to  differentiate  between 
satisfactory  and  nonsatisfactory  items. 

The  distributions  shown  are  quite  sensitive  to  the  value  of  p' 

(false  condemnation)  because  the  ratio  X/N  is  relatively  small.  When 
the  proportion  of  nondefectives  (1  -  X/N)  is  lower,  p'  has  less  effect. 


3.  Stratified  Populations 

More  generally,  we  can  suppose  the  lot  divided  into  k  strata 
•• »  \  °f  sizes  Np^,.  . . ,  N^  Nj  *  N)  such  that  for  any 

chosen  individual  in  ik  ,  the  probability  of  "detection  as  defective" 
(whether  this  is  really  so  or  not)  is  p^.  The  different  strata  may,  for 
example,  correspond  to  actual  defects  of  differing  degrees  of  visibility. 
The  case  considered  in  Section  2  corresponds  to  k  “  2,  p^  ■  p,  p^  ■  p' , 
Nj  •  X,  N2  •  N  -  X. 

The  nuriber  observed  as  "defective"  in  a  random  sample  of  size  n  is 
then  distributed  as 

k 

Z~  l  Bin(Y.,p.)  a  Mult  Hypg.  (n;  N.,...,N. ;  N)  .  (5) 
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The  binomials  are  mutually  independent,  conditional  on  Y;  for  the 
multivariate  hypergeometric,  Pr[Y  -  jr]  -  (Jj)  1  (y-|)  y^  *  n), 

Then 


E[Z(r)|Y] 


r! 


^r  k 

n  r.! 
j“l  -1 


k  r.  (r  ) 

n  (P^  Y.  J  )  , 
j-1  J  J 


(6) 


where  J  denotes  simulation  over  nonnegative  integers  r^ . r^  such 

that  Tj  =  t.  Taking  expectations  with  respect  to  Y, 


E[Z(r)] 
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r! 
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In  particular, 


(7) 


E[Z]  «  J  l  N.p.  -  np 
N  j-1  J  J 


(8.1) 


and 


E[Z(Z-1)] 


N(N^l)  [jL  Nj(Nj'1)Pj  *  2  fiy  Vj-PjPj’l 

[(  I  N  P  )2  -  J  N  p2]  , 

N(N  -1)  j.J  J  J  j»l  J  J 


(8.2) 


whence 


(8.3) 


Var(Z)  e  np(l  -p)  -  I  N.(p  -p)2 

N(N-1J  J  J 

where  p  *  N  1  $  ..  N.p.. 

Alternative  and  instructive  formulae  for  the  variance  are 

Var(Z)  «  Sfjf-  p(l-p)  .  2§^}(P  -  N'1  ^  HjPj)  (8.3)' 

Var(Z)  •  p(l-p)  *  n/"'U  J  N.p  (1-p  )  .  (8.3)" 

N'2  N  (N  - 1)  j=l  J  J  J 

The  first  term  in  (8.3)'  and  (8.3)"  corresponds  to  the  variance  of  the 
(actual)  number  of  defectives  in  a  random  sample  (without  replacement)  of 
size  n  from  a  population  of  size  N  containing  Np  defectives.  It  follows 

that  the  variance  of  Z  is  not  less  than  this,  while  from  (8.3)  it  cannot 

exceed  np (1-p) --the  value  it  would  have  in  sampling  with  replacement 
from  the  same  population  (when  the  distribution  of  Z  would  be  binomial 
with  parameters  n,p).  This  will,  of  course,  also  be  a  good  approximation 
when  N  is  large. 

As  a  limiting  case,  we  might  have  NL  *  1  and  k  =  N--that  is,  each 
item  in  the  lot  would  have  its  own  probability  (p^-)  of  being  declared 
defective . 

Note  that  this  differs  from  a  model  in  which  there  is  supposed  to  be 
a  prior  distribution  of  the  probability  of  being  declared  defective,  and 
the  p..’  ?  are  regarded  as  realized  values  from  this  distribution.  For 
this  latter  model,  we  reach,  in  effect,  a  "with  replacement"  situation, 
as  distinguished  from  the  "without  replacement"  model  we  have  considered, 
with  the  distribution  of  Z  depending  on  the  specific  values  of  the  p^'s. 


13 


The  "with  replacement"  model  can  be  regarded  as  a  mixture  of 
"without  replacement"  models  —  the  latter  being  conditional  on  the  specific 
sets  of  values  Pj^,. . . ,  p^  The  average  variance  of  the  number  of  items 
declared  defective  for  the  "without  replacement"  will,  in  general,  be 
smaller  than  that  for  the  "with  replacement" -- because  the  latter  is 
increased  by  variation  among  the  p's.  Formally, 

Var(Z)  =  E^fVarCZ)^]  ♦  Var£(E[Z|g])  .  (9) 

4.  A  Waiting  Time  Distribution 

Suppose  now  that,  under  the  conditions  of  Section  3,  items  are 
inspected  one  at  a  time  (without  replacement)  until  a  predetermined 
number  a  of  items  have  been  assessed  as  "defective".  Denoting  by  M  the 
number  of  items  needed  to  attain  this  goal,  we  have  (cf.  Section  4  of 
Johnson  et  at.  (1980)) 

Pr(M>m)  *  Pr(Z<a)  m  «  1,2,..., N-l  (a  s  N)  ,  (10) 

where  Z  has  the  distribution  (5)  with  n  replaced  by  m.  Note  that  in  this 
case  a  can  exceed  the  actual  number  of  defectives  in  the  lot  because  an 
item  can  be  assessed  as  defective  even  though  it  is  not. 

The  distribution  of  M  is  not  proper  because  there  is  a  positive 
probability  that  even  when  all  N  items  in  the  lot  have  been  inspected, 
fewer  than  a  items  will  be  declared  "defective". 

In  order  to  derive  the  distribution  of  M,  we  first  consider  the 
possible  sets  of  decisions  for  the  N  items.  The  distribution  of  the 
number  (D)  of  those  which  will  be  "defective"  is  that  of  the  sun  of 
independent  binomial  variables  Bj,B2, . . . ,  B^  with  parameters 
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N 

(N1»P1)»(N2*P2)»***»(Nk»Pic)  respectively.  Given  D,  each  of  the  (D) 
possible  orderings  of  the  D  "defective”  and  (N-D)  "not  defective" 
decisions  is  equally  likely,  and  the  number  of  items  up  to  and  including 
the  a**1  defective  (M)  has  the  negative  hypergeometric  distribution 

Pr(M«m)  -  (p)'1  ("iJjCplJ)  (n  *  a,  a+1,...,  N-D+a)  ,  (11) 


provided  D  2  a. 

The  conditional  expected  value  of  M  given  D  (2  a)  is 

E[M|D]  =  a(N+l)/(D  +  l)  .  (12) 

If  we  neglect  the  possibility  that  D  is  less  than  a,  the  overall  expected 
value  of  M  is  approximately 


a(N+l)E[(D+l)_1]  «  t(E[D]  ♦  l}*1  +  (E[D]  ♦  l}'3  Var(D)]  (N+l)a 


-  a(N+l) (  l  N.p.  ♦  l)'1  [1  ♦  (  l  N.p.  ♦  1)'£  l  N  p.(l-p  )] 

j-1  ^  j=l  3  3  j.i  1  3  1 

»  ap_1[l  ♦  J{1  ♦  i  ♦  \  l  w.p.(l-p.)}]  ,  (13) 

N  p  -2  jel  3  J  3 


where  w.  *  N^/N  (the  proportion  in  the  j^1  stratum). 

We  evaluate  the  overall  variance  of  M  as 

Var(M)  -  Var(E[M|D])  ♦  E[Var(M|D)]  (cf.  (9)) 

-  a2(N+l)2  Var((D+l)"2)  +  a(N+l)E&^^^]  . 

[_(D+ir  (D+2)J 


After  straightforward  though  quite  tedious  calculations,  we  find 
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2  k 

Var(M)  •  ±-  l  u).p.(l-p.) 

Np4  j-1  }  J  3 


♦  ifil 

P 


LJl[i  +  I  ♦  JL{,3~P  l  u.p.Cl-p.)  -  (3+a)}] 
-2  N  Np(P(l-p)j-i  H  *V  / 


a(l-p) 

-2 

P 


-  k 


U  ♦  i  -  ♦  -3-^a^  z  v-d-p.)]  . 

N  ^  Jf2(l-p)  j-i  3  3  3 


(14) 


As  N  •*  •,  the  expected  value  of  M  tends  to  ap’*  and  the  variance 
--2  - 

tends  to  ap  (1-p).  These  are  the  mean  and  variance,  respectively,  of 
the  (negative  binomial)  waiting  time  distribution  for  occurrence  of  a 
"successes"  in  independent  trials  with  probability  of  success  equal  to  p 
at  each  trial. 

(14)  can  be  written 


Var(M)  * 


a  (1-p) 

1  ♦  i 

3+a  j 

ZW.p.(l-p.)' 
i  .  L  rv  vr 

P2  1 

N 

Np 

[  pd*p)  J 

N  -pO^P)  J 

Since  “>jPj  (1  _Pj)  *  P  -  Ij.j  <*>jPj  *  P  -  P2  -  P(l-P)  and  ^  >  a  ♦  1, 
it  follows  that 


a(l-p) 

-2 

P 


[1 


1  3+a, 

N  ‘  Np J 


Var(M)  s 


P2 


(1+  i) 

1  NJ 
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